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Abstract: 

We discuss the impact of the treatment of NMC structure function data on parton 
distributions in the context of the NNPDF2.1 global PDF determination at NLO and 
NNLO. We show that the way these data are treated, and even their complete removal, 
has no effect on parton distributions at NLO, and at NNLO has an effect which is below 
one sigma. In particular, the Higgs production cross-section in the gluon fusion channel 
is very stable. 
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Fixed target deep-inelastic scattering data provide important constraints on parton 
distributions (PDFs) and are routinely included in PDF determinations. It has been 
recently suggested [T] that the results of current PDF determinations depend strongly on 
the treatment of the fixed target DIS data obtained by the NMC collaboration ^3}- in 
particular, according to whether data for cross-sections or structure functions are used in 
the fit. The substantial changes in the gluon distribution and as {Mz) found in Ref. [1] 
lead to a large shift in the Higgs production cross-section, which would, if correct, have 
very significant implications for Higgs searches at the Tevatron and LHC. This claim has 
generated an ongoing discussion on the adequacy of current Higgs mass limits [4|l5] : besides 
its interest in this context, the issue is relevant for the understanding of the comparative 
merits of PDF determinations based on a wider dataset (which contain more information 
but might be less consistent) and those based on a more limited but more consistent set 
of data. 

In this note we examine this issue within the context of the NNPDF2.1 NLO [6] 
and NNLO [7j PDF determinations. In contrast to the ABKM [8] determination, on 
which the results of Ref. [1] are based, NNPDF2.1 depends on a rather broader dataset, 
and uses the especially flexible NNPDF methodology (for a review see e.g. Ref. [9]), 
making it less vulnerable to parametrization bias. Related results (consistent with our 
findings) have been presented recently in the context of the MSTW [5] and CTEQ [10) 
PDF determinations. 

The kinematic coverage of the NMC data is compared in Fig.[T]to that of other datasets 
used to determine NNPDF2.1 PDFs: the other fixed target DIS data, the HERA colhder 
data, the fixed target Drell-Yan and Tevatron weak vector boson production data and 
the Tevatron inclusive jet data. We will now consider variants of NNPDF2.1 in which 
the NMC data are treated in different ways. In all other respects, we adopt the default 
settings of NNPDF2.1 as discussed in Refs. [SlITj. In particular, we take a fixed value 
for the strong coupling in both the NLO and NNLO fits, Ug {Mz) = 0.119, close to the 
PDG average [13]; sets with variable as{Mz) are also available [6l[71[lll[T5] , from which 
combined PDF+a^ uncertainties can be computed |16lll7j . 

The NMC collaboration has measured the neutral current deep-inelastic muon-nucleon 
cross-section 

^(-,2/,Q^) = 1^ [Y^Fi^'^{x,Q')^Y.xFi^^ix,Q')-y'Fi!^ix,Q')] , (1) 

where 1± = 1 it (1 — y)^. For NMC <^ so the parity-violating structure function 
xF^ can be neglected and only the electromagnetic components of F2 and Fl are relevant. 
It is convenient to define a reduced cross-section 
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-1 ^2^NC , „,2 



(x,y,Q^) = Fi^^(x,Q^) 2-2y + 



(2) 

where 

Rix.Q"-) = FLix.Q")! [F^ix.Q') - FLix.Q")) . (3) 

Equation ([2]) was used by the NMC collaboration [2l[3] to extract F2 from the measured 
cross-section Eq. ([T]), using for x < 0.12 a determination of R{x, Q^) from their own data. 
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NNPDF2.1 dataset | 




Figure 1: Kinematic coverage of the NMC data, compared to those of the other datasets in the 
NNPDF2.1 global analysis: the non-NMC fixed target DIS data, the HERA collider data, the fixed 
target DrcU-Yan and Tevatron weak vector boson production data and the Tevatron inclusive jet 
data. 



and for a; > 0.12 a parametrization iJiggo of R [llj obtained from a global fit to SLAG 
structure function data |12j . 

In all NLO NNPDF parton determinations p [T5l[T8H20] the NMC structure function 
data was used, both for the proton structure function F2 and the ratio of deuteron to 
proton structure functions, F2/F2. It may be reasonably argued however that data for 
the reduced cross-section, which is closer to what is measured experimentally, should be 
used instead. Note that the distinction is only relevant for the F2 data: since the isotriplet 
component of Fl (rr, Q^) is very small, BP{x,Q^) R'^{x,Q'^), so 

In Fig. [2] (to be compared to Fig. (1) of Ref. [S]) the form of R{x,Q'^) used by 
NMC (shown as black dots) in both regions is compared to the prediction obtained using 
NNPDF2.1 NLO and NNLO PDF sets. The parametrization Riggo does not come with 
an uncertainty, however the typical size of the uncertainty on it can be inferred by com- 
paring it to the data of Ref. [12] on which it is based, also shown in Fig. [2l Note that 
the SLAC data are concentrated at low Q^, hence in most of the NMC kinematic region 
this parametrization is an extrapolation and thus subject to very large uncertainties. It 
is clear from Fig. [2] that (as emphasized in Ref. |8]) the R values used by NMC at low x 
do not agree well with the prediction from the use of modern PDF sets, while instead the 
parametrization -Riggo is in good agreement with the NNPDF prediction within the large 
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Figure 2: The NNPDF2.1 NLO and NNLO results for R{x,Q^) Eq. © at a; = 0.11 (left) and 
X = 0.14 (right), compared to the values of R used by NMC in Ref. [5], and the SLAG data of 
Ref. [H] on which the parametrization [TT] used by NMC for x > 0.12 is based. 




Figure 3: The NNPDF2.1 NLO and NNLO predictions for the NMC reduced cross-sections for 
X = 0.05 (left), X = 0.11 (center) and x = 0.225 (right ), compared to the NMC data [2]. 



uncertainty of the data on which it is based, especially if NNLO theory is used. Thus 
the use of NMC cross-sections instead of structure functions (which rely on these partly 
inadequate assumptions on R) does indeed appear to be in principle more advisable. 

However, it is unclear whether in practice the effect of this replacement may be sig- 
nificant, especially in view of the fact that the NMC data are known to have internal 
consistency problems, as shown long ago in Ref. [22]. To illustrate this, in Fig. [3] the 
NMC reduced cross-section data are compared to NLO and NNLO predictions obtained 
using the corresponding NNPDF2.1 PDF sets. It is clear that the data show larger point- 
by-point fluctuations than one would expect from their nominal uncertainties, thereby 
suggesting that the effect of the treatment of the relatively small i?-dependent correction 
might be moderate on the scale of these fluctuations. 

In order to settle the issue quantitatively, we construct and compare, both at NLO and 
at NNLO, three PDF sets: one in which NMC data for the proton structure function F2 
are used, one in which data for the proton reduced cross-section are used (supplemented, 
in both cases, by data for the ratio F2 /F2), and one in which NMC data (both for proton 
and the deuteron/proton ratio) are removed altogether from the global dataset. In all 
cases sets of Nj-^p = 100 replicas have been produced. Note that the published (default) 
NNPDF2.1 sets [6lll] use NMC structure functions at NLO and NMC cross-sections at 
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NNPDF2.1 NLO 


NNPDF2.1 NNLO 




str. fctn. 


xsec. 


noNMC 


str. fctn. 


xsec. 


noNMC 


Total 


1.16 


1.14 


1.09 


1.16 


1.16 


1.12 


NMC-pd 


0.97 


0.98 


- 


0.93 


0.93 


- 


NMCp 


1.73 


1.67 


- 


1.69 


1.63 


- 


SLAC 


1.27 


1.27 


1.28 


1.05 


1.01 


1.00 


BCDMS 


1.24 


1.23 


1.18 


1.29 


1.32 


1.27 


HERAI-AV 


1.07 


1.05 


1.07 


1.12 


1.10 


1.08 


CHORUS 


1.15 


1.11 


1.07 


1.12 


1.12 


1.12 


FLH108 


1.37 


1.34 


1.38 


1.27 


1.26 


1.29 


NTVDMN 


0.47 


0.51 


0.42 


0.50 


0.49 


0.50 


ZEUS-H2 


1.29 


1.23 


1.24 


1.32 


1.31 


1.30 


ZEUSF2C 


0.78 


0.74 


0.72 


0.88 


0.88 


0.89 


H1F2C 


1.51 


1.48 


1.49 


1.47 


1.56 


1.52 


DYE605 


0.85 


0.93 


0.88 


0.81 


0.81 


0.81 


DYE866 


1.27 


1.40 


1.34 


1.31 


1.32 


1.34 


CDFWASY 


1.85 


1.87 


1.60 


1.55 


1.65 


1.41 


CDFZRAP 


1.62 


1.76 


1.64 


2.16 


2.12 


2.18 


DOZRAP 


0.60 


0.57 


0.56 


0.67 


0.67 


0.67 


CDFR2KT 


0.97 


0.73 


0.81 


0.79 


0.74 


0.80 


D0R2CON 


0.84 


0.90 


0.96 


0.84 


0.82 


0.84 



Table 1: The of NNPDF2.1 PDFs at NLO and NNLO when NMC data are included as structure 
functions, reduced cross-sections, or not included. 

NNLO. 

In Table [J we compare the values obtained in these three fits, both for the global 
fit and individual experiments. The quality of the global fit is unchanged at NNLO and 
improves slightly at NLO when the structure function data are replaced by cross-section 
data, and in both cases the quality of the fit to NMC data improves slightly, with the 
quality of the fit to other data unchanged. This suggests that the use of cross-section 
data is indeed somewhat more consistent for NMC, but also that this has little or no 
effect on other experiments. When the NMC data is removed altogether, the global fit 
quality improves, due to the fact that Xnmc rather poor in view of the aforementioned 
inconsistencies, regardless of how NMC data are treated. In particular, the fit to the 
BCDMS data, which measure the same structure functions as NMC in a partly overlapping 
region, improves somewhat when the NMC data are removed. 

We now compare the PDFs obtained in the various cases. In Fig. U (NLO) and Fig. [5] 
(NNLO) we show the distances (as defined in Ref. [IS]), computed both for central values 
and uncertainties, between PDFs in the sets with NMC cross-section vs. structure function 
data, and a set without NMC data vs. the the default NNPDF2.1 set. Recall that d ~ 1 
corresponds to statistically indistinguishable results, while, for sets of 100 replicas, d ~ 7 
corresponds to a shift by one sigma (i.e. results are statistically distinguishable, but 
compatible within uncertainties). 

These plots show that at NLO the replacement of structure functions with cross- 
sections is at the level of statistical fiuctuations. At NNLO a small, statistically significant, 
shift in central values and uncertainties at the level of at most a third of a sigma but mostly 
lower is seen in some PDFs (specifically the quark singlet and isospin triplet and the gluon) . 
The effect of removing NMC data altogether at NLO is again almost indistinguishable from 
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Figure 4: Distances (defined as in Ref. [15]) between NLO PDF sets with NMC structure functions 
and NMC cross-sections (top) and PDF sets witli NMC structure functions and PDF sets witliout 
NMC data (bottom). All distances have been computed using sets of A^rep = 100 replicas. 



a statistical fluctuation with the possible exception of the quark singlet for 0.02 < x < 0.5 
which shows a shift by little more than a quarter of standard deviation (though this could 
be a statistical fluctuation due to the size of the rephca sample). At NNLO instead the 
effect of removing the NMC data altogether is clearly statistically significant on the isospin 
triplet and gluon, corresponding to a shift in central values at the level of almost one sigma 
for the gluon and more than half sigma for the isospin triplet. A one sigma change of the 
triplet uncertainty is also observed. 

Some of these PDFs at NLO and NNLO are compared in Figs. [6] and [7] respectively, 
at a typical electroweak scale = 10^ GeV^. Differences at higher scale are somewhat 
reduced because of asymptotic freedom, but the general pattern observed in the distance 
plots is clearly reproduced: at NLO replacing NMC structure functions with cross-sections 
has no effect, while at NNLO it has an effect which is above the threshold of statistical 
significance, though smaller than the change that would be observed if the data changed 
by an amount compatible with their uncertainties. The effect of removing NMC data 
altogether, both at NLO and NNLO, is qualitatively similar but quantitatively somewhat 
larger. 

We conclude that at NLO replacing structure functions with cross-sections or even 
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Figure 5: Same as Fig.|4|but at NNLO. Note that in tliis case tiie PDF set witli NMC cross-section 
data (NNLO default) is used for the comparison witlr the PDFs with no NMC data. 



removing NMC data altogether has no effect, while at NNLO replacing structure functions 
with cross-sections is just above the threshold of statistical significance, and removing 
them altogether statistically significant, though in all cases below the effect of a one sigma 
change of the data: this can be viewed as an upper bound on the possible impact of the 
treatment of this dataset. 

The main implication of the study of Ref. [1], and the reason for the the ensuing 
debate, was that the Higgs production cross-section via gluon-gluon fusion may change as 
a consequence of the treatment of NMC data by an amount which may invalidate current 
Higgs exclusion limits. To verify what happens in our case, we have recomputed the 
Higgs production cross-section using the various PDF sets discussed here, using the code 
of Refs. |231I24|. for a range of Higgs masses between 100 and 400 GeV. We show results 
for the Tevatron and the LHC 7 TeV in Fig. [8l all uncertainties shown are 68% confidence 
levels. We see that the replacement of NMC structure functions by cross-sections has no 
impact on the Higgs production cross-section, and that even removing all NMC data leads 
to a shift much smaller than the nominal PDF uncertainties, with a slight increase of these 
uncertainties. Again, this can be viewed as a (conservative) estimate of the differences 
arising from the different treatments of the NMC data. 

Let us finally compare our results with those of Ref. [1]. In that reference, the value 
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Figure 6: NLO PDFs determined using NMC cross-section data (long dashes) and no NMC data 
(short dashes) shown as ratio to the default NNPDF2.1 set (determined using NMC structure 
function data) at = 10^ GeV-^: singlet E(a;, Q^) (top, left), gluon g{x,Q^) (top, right), triplet 
TsiXjQ^) (bottom, left) and total valence V{x,Q^) (bottom, right). 



of as was determined together with the PDFs, and the best-fit ag was found to change 
significantly according to the treatment of the NMC data. In particular, the change of 
the best-fit as value was found to be of order of 1.5 sigma at NLO and 2.3 sigma at 
NNLO, with an increase of the Higgs cross-section at the LHC by 4% (i.e. about one 
sigma) at NLO and 9% (i.e. about 2.7 sigma) at NNLO when the NMC cross-section 
data are replaced by structure function data. In order to assess quantitatively how much 
of this change in Higgs cross-section is just due to the different value of as a comparison 
between ABKM sets with fixed value of as but different treatment of NMC data would 
be necessary. These sets are at present not available. However, a simple estimate (which 
at NLO is in fact quite accurate [13]) can be obtained by noting that, based on the size 
of the NLO and NNLO iiT-factors one expects that a percentage change Aa in the value 
of as, if everything else is kept fixed, leads to a percentage shift of the Higgs cross-section 
Aa ~ 2.5Aa at NLO and Aa ~ 2.8Aa at NNLO. Based on this, one would estimate that 
about 90% of the cross-section increase seen in Ref. [1] at NLO when structure function 
data replace cross-section data and about 80% of the increase at NNLO, is just due to the 
change in value of a^. The residual change, due to the PDFs, is still perhaps somewhat 
larger than that which we observe, but qualitatively more in line with it. 

We conclude that we do not support the conclusion that the treatment of NMC data 
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Figure 7: NNLO PDFs determined using NMC structure function data (long dashes) and no 
NMC data (short dashes) shown as ratio to the default NNPDF2.1 set (determined using NMC 
cross-section data) at = 10"* GeV^: up u{x, Q^) (top, left), gluon g{x, Q^) (top, right), antdown 
d{x,Q^) (bottom, left) and strange s{x,Q^) (bottom, right). 



may affect the Higgs cross-section and thus exclusion limits in any significant way. Of 
course, if the value of Ug is varied by a very large amount, then the cross-section and 
ensuing limits are significantly affected. In this respect, it should be noticed that the best- 
fit as{Mz) = 0.1135 value of Refs. [HIH] at NNLO differs by more than 7 sigma from the 
PDG value as(M^) = 0.1184 [13], in units of the latter's uncertainty Aa^ = 0.0007. The 
Higgs working group [T7] following PDF4LHC [25] , recommends to use a more conservative 
Aa^ = 0.0012, but even so the value of Refs. [HIH] differs by more than four sigma from 
the PDG average. A NLO determination of as based on the NNPDF2.1 PDF fit [26] 
leads to values of as which are in good agreement with the PDG value, both when the 
global dataset {as{Mz) = 0.1191) and deep-inelastic data only (as(M^) = 0.1177) are 
used. Given that, as we have just shown, the treatment of NMC data has no statistically 
significant impact on the NLO analysis, it is exceedingly unlikely that the NNPDF NLO 
determination of as might depend on how the NMC data are treated. It will be interesting 
to repeat the analysis of Ref. [26] at NNLO. The stability of NNPDF2.1 PDFs when going 
from NLO to NNLO [7] suggests that results should not change dramatically. 

In summary, we find that the effect of the treatment of NMC data on NNPDF2.1 
PDFs is of no statistical significance at NLO, and just about statistically significant at 
NNLO though by at least a factor three smaller than the typical PDF uncertainty due to 
propagated data uncertainties. The effect on the Higgs production cross-section is accord- 
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Figure 8: The Higgs cross-section in gluon fusion at the Tevatron Run II (top) and at the LHC 7 
TeV (bottom). Left: the reference NNPDF2.1 NLO (NMC structure functions) compared to the 
NLO fits with NMC cross-sections and no NMC data, shown as a ratio to the reference. Right: the 
reference NNPDF2.1N NLO (NMC cross-sections) compared to the NLO fits with NMC structure 
functions and no NMC data, shown as a ratio to the reference. All uncertainties shown are one 
sigma. 



ingly negligible. Even removing NMC data altogether has a moderate effect on NNPDF2.1 
PDFs, which even at NNLO remains below one sigma. The considerable stability of the 
NNPDF2.1 results is due both to the use of a very wide dataset which includes DIS, Drell- 
Yan, weak vector boson production and inclusive jet data, which reduces the dependence 
of our results on any particular dataset, and to the extremely flexible neural network 
parametrization which eliminates the parametrization bias which might otherwise lead to 
instabilities on small shifts in input data. 

Acknowledgments 

We thank S. Alekhin for correspondence and discussions on the matter of this paper. 
M.U. is supported by the Bundesministerium fiir Bildung and Forschung (BmBF) of the 
Federal Republic of Germany (project code 05H09PAE). This work was partly supported 
by the Spanish MFC FIS2007-60350 grant. We would like to acknowledge the use of the 
computing resources provided by the Black Forest Grid Initiative in Freiburg. 



10 



References 

[1] S. Alekhin, J. Blumlein and S. Moch, (2011), arXiv:1101.5261. 

[2] New Muon Collaboration, M. Arneodo et al., Nucl. Phys. B483 (1997) 3, hep- 
ph/9610231. 

[3] New Muon Collaboration, M. Arneodo et al., Nucl. Phys. B487 (1997) 3, hep- 
ex/9611022. 

[4] J. Baglio, A. Djouadi and R. Godbole, (2011), arXiv:1107.0281. 
[5] R. S. Thorne and G. Watt, (2011), arXiv:1106.5789. 

[6] The NNPDF Collaboration, R.D. Bah et al., Nucl. Phys. B849 (2011) 296, 
arXiv:1101.1300. 

[7] R. D. Ball et al. [The NNPDF Collaboration], (2011) arXiv: 1107.2652. 
[8] S. Alekhin et al., Phys. Rev. D81 (2010) 014032, arXiv:0908.2766. 
[9] S. Forte, Acta Phys. Polon. B41 (2010) 2859, arXiv:1011.5247. 
[10] J. Huston, talk at the PDF4LHC workshop, CERN, March 2011 

http : //indico . cern. ch/getFile .py/ access?contribId=2&resId=l&materialId=slides&confId= 127425 

[11] L.W. Whitlow et al., Phys. Lett. B282 (1992) 475. 

[12] L.W. Whitlow et al, Phys. Lett. 250B (1990) 193. 

[13] S. Bethke, Eur. Phys. J. C64 (2009) 689, arXiv:0908.1135. 

[14] F. Demartin et al., Phys. Rev. D82 (2010) 014002, arXiv: 1004.0962. 

[15] The NNPDF collaboration, R.D. Ball et al., Nucl. Phys. B838 (2010) 136, 1002.4407. 

[16] S. Alekhin et al., (2011), arXiv:1101.0536. 

[17] LHC Higgs Cross Section Working Group, S. Dittmaier et al., (2011), 
arXiv:1101.0593. 

[18] The NNPDF cohaboration, L. Del Debbio et al., JHEP 03 (2007) 039, hep- 
ph/0701127. 

[19] The NNPDF Collaboration, R.D. Ball et al., Nucl. Phys. B809 (2009) 1, 
arXiv:0808.1231. 

[20] The NNPDF collaboration, R.D. Bah et al., Nucl. Phys. B823 (2009) 195, 
arXiv:0906.1958. 

[21] The NNPDF collaboration, R.D. Ball et al., JHEP 05 (2010) 075, arXiv:0912.2276. 
[22] S. Forte et al., JHEP 05 (2002) 062, hep-ph/0204232. 



11 



[23] R. Bonciani, G. Degrassi and A. Vicini, JHEP 11 (2007) 095, arXiv:0709.4227. 
[24] U. Aglietti et al., JHEP 01 (2007) 021, hep-ph/0611266. 
[25] M. Botje et al, (2011) 1101.0538. 

[26] S. Lionetti et al., Phys. Lett. B701 (2011) 346, arXiv:1103.2369. 



12 



